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Foreword 

What is the use of a book without pictures or conversations? 

The current language of mathematics is primarily object-oriented. 
A classical textbook on a given subject takes some notion, such as 
that of a singular integral operator or an abelian group and tries to 
say as much as possible about it using whatever tools are needed for 
each particular lemma or proposition. The hope seems to be that the 
reader will be fascinated enough with the wide landscape that slowly 
unfolds before his eyes to learn to jump over trenches, climb trees and 
slopes, and wade through thickets and streams essentially on his own 
after watching his guide doing it effortlessly and with elegance. This 
certainly does make sense and most of us have learned mathematics 
exactly in this way. 

However, there are at least a few of us, who look at the marble 
palaces of mathematics not from the viewpoint of an artist or a town 
architect, but from that of a mason, a carpenter, or an engineer. When 
looking at an equine statue, such a person marvels not so much at 
the flaring nostrils of the steed or the majestic posture of the rider, but 
rather at the amazing balance that allows the whole heavy composition 
to rest on its foundation touching it only by the two rear hoofs of the 
horse. 

If you look at mathematics, or anything else, from this point of 
view, another landscape will open to your eyes: not that of towers 
rising to the sky, wide streets and blossoming gardens, but that of the 
intricate machinery that makes the whole city tick in unison: water 
pumps, electric lines, networks of tunnels and bridges, and so on up 
to the concave shape of the pavements and the sewer ditches around 
them. 

The description of this second landscape in an object-oriented lan¬ 
guage is pretty dull. Indeed, what can be said about a sewer ditch as 
an isolated object that is interesting enough to merit a discussion? Its 
beauty reveals itself only in the way it works in a heavy rain keeping 
the glossy boulevards and the shiny buildings around it safe from floods 
that would otherwise wash away both the speedy chariots and the no¬ 
ble inhabitants. In other words, its whole tale is in the way it operates, 
so a function-oriented language would be the best one to employ in this 
case. The textbooks written in this language are few and we decided 
that it might make sense to try our own hand at writing one. 

Another common feature of many modern textbooks is their ency¬ 
clopedic character. The authors probably feel like they are obliged to 
squeeze the whole body of human knowledge into a single volume. The 
resulting 400-1- page leviathans are perfect reference books when one 
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needs to check if some particular statement is true or known, but if 
one wants to taste the flavor of the subject for the first time, the sheer 
magnitude of the proposed endeavor of reading a few hundred pages of 
(usually quite laconic) text can easily scare him away. 

Hence, instead of trying to present everything we knew about the 
dyadic techniques, we tried to hang a thin thread from the very basics 
to one of the most recent results about the weighted norm inequalities 
for singular integral operators. If these words sound a bit intimidating, 
there is no need to worry. If the reader assumes that all functions and 
kernels are actually continuous in the entire space and then uses the 
usual Riemann integration theory instead of the Lebesgue one, he will 
lose next to nothing. 

The resulting text of approximately 50 pages is as self-contained as 
it can be and we hope that the reader may be tempted to take a look 
at something this short. It is also worth mentioning that quite a few of 
those 50 pages are devoted to explanations of why this or that should 
be done in some or other particular way. Without these explanations 
and the illustrations this book would shrink even further. 

We are aware of the criticism that each person visualizes any par¬ 
ticular argument in his own way and that the commonly used language 
that carefully avoids any informal passages is designed this way pre¬ 
cisely to eliminate the superposition of any subjective views or associ¬ 
ations onto the pristinely objective chains of syllogisms. However, we 
have never seen those pristine chains anywhere. The way every single 
dehnition or statement is composed already reveals if not the person¬ 
ality of the author, then at least the school to which he belongs. Most 
important, the author’s personality is inseparable from the way he puts 
the whole argument together, so the choice is really between presenting 
a highly personal and subjective structure of the proof without any la¬ 
bels or with labels explaining what, in the opinion of the author, each 
joint or block is there for. 

Of course, when looking at works of a highly skilled craftsman with 
ultimately weird internal logic, the first option may be preferable, but in 
general to strip the labels away is easier than to restore them. We thus 
felt free to supplement the formal arguments with informal comments 
any time we wanted. We also included pictures and did not hesitate 
to change the existing terminology slightly or to introduce a new one 
when we thought that it might help to convey some particular point 
more efficiently. 

The title comes partly as a challenge to some modern calculus text¬ 
book writers whose 3-pound bricks, all coming out of the same kiln 
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at the speed of a Ford car plant conveyor belt, would constitute a de¬ 
cent weapon in a fray^ but are of little assistance in a classroom (our 
hats are off for the likes of M. Spivak, though). The main problem 
is that those textbooks teach just to follow standard algorithms, thus 
viewing a student as a (rather badly wired and slow) computer to pro¬ 
gram, never appealing to such natural human feelings as curiosity and 
admiration of the unfamiliar. While in many these feelings are long 
dead and difficult to resurrect, the flickering light of the magic lamp of 
human wit and intelligence is not yet totally obscured by the infernal 
hres of regional wars or extinguished by the brainwashing waterfalls of 
mass media. We wanted to add some oil to that lamp. Whether and 
to what extent we have succeeded or failed is left to the judgement of 
the reader. 

Summer 2014, Kent-Ramat Gan 


^if not for the glossy covers that slip out of one’s fingers too easily, though 
cutting a hand-size hole near one of the edges can partially remedy the issue and 
make the book easier to carry around in times of peace 
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1. Introduction 

The dyadic technique is a game of cubes, and this is the way we 
try to present it. We start the general theory with the basic notion of 
a dyadic lattice, proceed with the multiresolution, the Three Lattice 
Theorem (probably due to T. Hytonen), augmentation and stopping 
times, and hnish the exposition with Carleson families. 

The point of view we are trying to promote differs from the standard 
one in only one respect: we try to exhibit various standard families of 
cubes as whole game sets, which can be reshuffled and complemented 
freely in the related constructions as the need arises, rather than dehn- 
ing them recursively and going from one generation to another, check¬ 
ing the properties by (backward) induction. This point of view led us 
to the introduction of an explicit graph structure on a dyadic lattice 
and a couple of related notions, which are almost always mentioned 
in informal conversations between specialists under various names, but 
had hardly ever been formalized in writing. 

The particular application we chose to demonstrate how the dyadic 
cubes can be played with in analysis is weighted norm inequalities for 
singular integral operators. The high points of this second half of the 
book are the estimate for an arbitrary measurable function in terms 
of its A-oscillations on a Carleson family of cubes, a pointwise bound 
that allows one to easily reduce the weighted inequalities for Calderon- 
Zygmund operators to those for sparse operators, and a unihed ap¬ 
proach to the linear and multilinear theory. 

The (small) cost to pay is that the formalization of the multiweight 
problem most suitable for our exposition is somewhat different from 
the commonly used one, though the translation from our language to 
the standard one is immediate, and we state the main result in this 
book (the “Ap-conjecture” about the sharp dependence of the opera¬ 
tor norms in weighted spaces on the joint Muckenhoupt norm of 
weights) in both forms in the hnal sections. 

The history of dyadic techniques and weighted norm inequalities 
merits a separate volume. If such a monograph were ever written it 
would probably be longer than our entire exposition even if restricted 
to stating merely who did what and when, omitting all subtle interplays 
of events and ideas. We highlight a few relevant works most directly 
connected with our presentation in the short Historical Notes section 
near the end of this book, but we are very far from claiming that our 
selection is complete or even representative. 

Acknowledgements. We are thankful to numerous people who 
read the preliminary version of this manuscript and shared their re¬ 
marks and suggestions with us. This project would also be difficult to 


carry out without the genereous support of the Israel Science Founda¬ 
tion and the National Science Foundation 

2. Dyadic cubes and lattices 

By a cube in M” of sidelength I we always mean a half-open cube 

Q = [xi, Xi + 1) X ■ ■ ■ X [Xn, Xn + /) {x = (Xi, . . . , Xn) G M"', / > 0) 

with sides parallel to the coordinate axes. We will call the point x the 
“corner” of the cube Q and denote it by x{Q). It will be also convenient 
to introduce the notation c{Q) for the “center” x + ,..., 1) of Q and 

£q for the sidelength of Q. 

Let Q be any cube in M”. A (dyadic) child of Q is any of the 2"' 
cubes obtained by partitioning Q hy n “median hyperplanes” (i.e, the 
hyperplanes parallel to the faces of Q and dividing each edge into 2 
equal parts). 


Qoi 

Qii 1 


'c{Q) 1 

Qoo 

Qio 1 


x{Q) 


Figure 1. A square Q and its 4 dyadic children Qp-. 


Passing from Q to its children, then to the children of the children, 
etc., we obtain a standard dyadic lattice V^Q) of subcubes of Q (see 
Figure 2). 

The cubes in this lattice enjoy several nice properties of which the 
most important ones seem to be 

(i) for each k = 0,1,2,, the cubes in the k-th generation T>k{Q) 
have the same sidelength 2' and tile Q in a regular way, 
i.e., in the same way as the integer shifts of [0,1)” tile M"'; 

(ii) each cube Q' G 'Dk{Q) has 2"' children in Tl>k^i{Q) contained in 
it and (unless it is Q itself) one parent in Vk-i{Q) containing 
it; 

^ISF grant 953/13 (A.L.) and NSF grant DMS 080243 (F.N.) 
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HQ) 

HQ) 

HQ) 

HQ) 

HQ) 



Figure 2. The top 5 generations in the dyadic lattice 'D{Q). 

(hi) for every two cnbes Q'.Q" G T>{Q), either Q' fl Q" = 0, or 

Q' C g", or g" C g'; 

(iv) if Q' e V{Q), then V{Q') C V{Q). 

The dyadic lattice T>{Q) has many other advantages bnt two es¬ 
sential drawbacks: it is completely rigid and covers only a part of the 
entire space. All that can be done to compensate for these drawbacks 
is to move it aronnd and scale, which is enongh for most pnrposes bnt 
still forces one to nse awkward phrases like “considering only fnnctions 
with compact snpport and choosing Q so that it covers the snpport” 
or “assnming that J / = 0 to avoid writing the top “mean” term in 
the Haar decomposition of / separately”. 

Onr hrst goal will be to introdnce a notion of a dyadic lattice & 
that takes care of all M” at once. Onr particular choice of axioms was 
dictated by the necessity to ensure that certain constructions be pos¬ 
sible and the desire to make the property of being a lattice reasonably 
easy to use or verify. The cost is that the “classical” dyadic lattice 

V = mi -|- 1) X • • • X [m„, m„ -|- 1)) : A: e Z, mi,..., m„ G Z} 

is not a dyadic lattice in our sense, though any hnite set of cubes in T) 
is a subset of some (actually, inhnitely many) dyadic lattices 

Definition 2.1. A dyadic lattice 3i in is any collection of cubes 
such that 

(DL-1) if g G then each child of g is in ^ as well (this, of course, 
implies immediately that T>{Q) C ^); 
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(DL- 2 ) every 2 cubes Q'iQ" G Qi have a common ancestor, i.e., there 
exists Q E ^ such that Q', Q" G T>{Q)] 

(DL-3) for every compact set K C M", there exists a cube Q E ^ 
containing K. 

Property (DL-2) can be used several times in a row to hnd a com¬ 
mon ancestor of any hnite family of cubes by hrst hnding an ancestor 
Q 2 of Qi and Q 2 , then an ancestor Q 3 of Q 2 and Qs, etc. It also ensures 
that all cubes in ^ lie in a usual neat way with respect to each other, 
so all useful properties of cubes in V^Q) hold in ^ as well except the 
existence of the “very top cube”, which is usually a nuisance rather 
than an asset. 

We can also split the cubes in ^ into generations (k G Z) by 
choosing an arbitrary cube Qq E S) and declaring it of generation 0. 
The generation of any other cube Q can be then determined by hnding 
some common ancestor P oiQ and Qo and taking the difference of the 
generations of Q and Qq in T>{P). 

It follows immediately from the way the generations are dehned 
that all cubes in have the same sidelength 2 ' -%o- Property (DL-3) 
and the common ancestor trick also imply that the cubes in file IP"'. 

We always think that the generation number increases as the cubes 
shrink and decreases as they expand, so the parent of Q E is in 
^k-i and the children of Q are in ^k+i- 

Once the dehnition is given, our hrst task is to demonstrate that 
dyadic lattices do exist. It is not hard and the idea is the same as 
in the construction of the classical dyadic lattice: take any cube Q, 
construct the dyadic lattice D(Q), and then expand it inductively up 
and sideways by choosing one of 2 ” possible parents for the top cube and 
including it into ^ together with all its dyadic subcubes during each 
step. The only catch is that the choice should be done either carefully 
enough, or recklessly enough (both approaches work) to ensure that we 
do not just move in one direction all the time, like in the classical lattice 
where we always expand the top cube from its corner. The particular 
way of expansion we will use is alternating the expansions from the 
corner with those from the vertex opposite to the corner. 

This is by no means the only way to go. Starting with any cube 
Qo and the corresponding dyadic lattice V{Qo), construct any ascend¬ 
ing sequence of cubes Qq C Qi C Q 2 P ■ ■ ■ so that Qj is a dyadic 
child of Qj+i and UjintQj = M”. Note that then Qj E P{Qj+i) and, 
thereby V{Qj) C D(Qj+i). Put S' = [JY=iP{Qj)- Then S' satishes all 
conditions of Dehnition 2.1. 

(DL-1) If Q G then Q E P{Qj) for some j, so all children of Q are 
in the same ViQj) C 
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Figure 3. The expanding families of intervals and squares. 


(DL-2) Let Q'iQ” G Then Q' G V{Qji) and Q” G V{Qjii) for some 

j', j". The cube QmaxO'j") G ^ is a common ancestor of Q', Q". 
(DL-3) intQj is an expanding chain of open sets covering the entire 
space, so any compact will get absorbed sooner or later. 


The chain depicted on Figure 3 is 

[-2^2^+!)^, j is odd; 


Qj = 


[_2t+i^ 2-^)"', j is even. 


It clearly satishes all requirements, but most other sequences will do 
just as well. 


2.1. Dyadic lattice as a multiresolution. Any dyadic lattice 
^ consists of a sequence of generations The cubes in form a 
regular tiling of M". More precisely: 

(*) if we know any cube Q G T^, then all cubes in are given by 
x{Q) + IqU + igQo, where u G Z"' and Qo = [0,1)"'. 

The tilings and agree with each other in the sense that every 
cube in is a child of a unique cube in T^, so T^+i is a rehnement of 
Tfc. Property (DL-3) can be restated in terms of tilings Tk in literally 
the same way: every compact K C M"' is contained in some cube in 
one of the tilings T^. 

This all makes it tempting to look at dyadic lattices from another 
point of view and to introduce the following dehnition. 


Definition 2.2. Let T = ^k&iTk be a system of cubes comprised 
of regular (in the sense of (* *)) tilings of M”. We say that T is a 
dyadic multiresolution of M”' if 

(DM-1) for every fc, the tiling T^+i consists exactly of dyadic children 
of cubes in T^; 
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(DM-2) for every compact K C M"", there exists some G Z and 
Q ^ Tk such that Q D K. 

Remark 2.3. Condition (DM-1) may be quite unpleasant to check 
directly. However, there is a nice simple criterion for it to hold: if we 
can hnd a cube Q = x{Q) + IqQq G such that its “corner child” 
x{Q) + ^Qo is in Tfc+i, then (DM-1) holds. 

Indeed, then every cube Q' G Tk+i can be represented as 

x{Q) + -|- -^Qo, u G Z"'. 

Writing u = 2n n G Z”, £ G {0,1}” and observing that the dyadic 
children of a cube R are x{R) -|- -|- ^Qo, s G {0,1}", we see that Q' 

is a child of x{Q) + iqv + iqQo G T^. 

Note that for all k, if the tiling T^+i consists of dyadic children of 
cubes in T^., then Tk +2 consists of dyadic grandchildren of cubes in T^, 
etc., so Tfc+m = UQgT'fc^^m(Q) where RmiQ) is the m-th generation of 
T>{Q). Since the cubes in are pairwise disjoint, this implies, in par¬ 
ticular, that for two cubes Q E and Q' G Tk+m in a multiresolution, 
the conditions Q' C Q and Q' G T>{Q) are equivalent. 

Now we are ready to show that every dyadic multiresolution is a 
dyadic lattice. Indeed, if Q G T^. for some k, then every child of Q 
belongs to Tk+i, establishing (DL-1). Property (DL-3) is the same as 
property (DM-2). Finally, if Q',Q" G T, then, by (DM-2), there exists 
Q E T such that Q', Q" C Q. But, as we have seen above, this is 
equivalent to Q'^Q” E T>{Q), establishing (DL-2). 

2.2. The baby Lebesgue differentiation theorem. Dyadic lat¬ 
tices are often used in analysis to discretize some statements and argu¬ 
ments about functions on M"'. The general idea is that every reasonable 
function / : M” —)■ M is constant up to a negligible error on sufficiently 
small cubes Q E 2i. This assertion can be understood literally if / 
is continuous. If the function / is merely measurable, this general 
principle should be interpreted with some caution. The particular for¬ 
mulation we present in this section is not the strongest one but it is 
easy to prove and suffices for all purposes of this book. 

Let / be a measurable function that is hnite almost everywhere. 
We say that x G M" is a (dyadic) weak Lebesque point of / if for every 
e, A > 0, we have 

\{yEQ-. \f{y) - f{x)\ > £}| < A|Q| 
for at least one cube Q E containing x. 

Theorem 2.4. Almost every point x E MA is a weak Lebesgue point 
off. 
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Proof. Fix e, A > 0 and consider the set of “bad” points x e 
for which for every cube Q E containing x, there is a set U{Q, x) (Z Q 
of measure \U{Q,x)\ ^ A|(5| such that \f — f{x)\ ^ e on U{Q,x). For 
a G M, i? > 0, let 

E{a, R) = {x : |x| ^ R, \ f{x) — a| < e/3}. 

Note that = M", so it is enough to show that the 

outer Lebesgue measure n of the set F'bad(a,-R) of the bad points in 
each E{a, R) is 0. Assume p > 0. By the definition of the outer 
Lebesgue measure, we can choose a dyadic cover^ F'bad(a 5 R) C Uq^sQ 
{S C with \Q\ < (1 + X)fi. We can also remove from S 

all cubes Q disjoint with i?bad(a,-R)- But then for each cube Q G 
S, we can choose a point x{Q) G Q fl F'bad(a 5 -R) and note that the 
corresponding set U{Q, x(Q)) is disjoint with E{a, R) because all values 
of / on f/(Q, x(Q)) differ by at least e from/(a:(Q)) G {a—e/3,a+e/3). 
Thus, 

A* < ^ 10 \ U(Q,x{Q))\ < (1 - A) ^ 101 < (1 - 

QgS QgS 

which is a clear contradiction. □ 


3. The Three Lattice Theorem 


Quite often we can easily estimate some quantities in terms of av¬ 
erages /q = Jq / of positive functions over some cubes in M” but 
not necessarily over dyadic cubes. 

We would like to estimate these averages by the averages of the 
same kind but taken over the cubes in some dyadic lattice ^ only. To 
this end, note that ii Qi Z Q Z Q 2 , then 


1 ^ 

IQI 


/Qi ^ /q ^ 


IQ2I r- 
\Q\ 


Thus, our task can be accomplished with decent precision if for an 
arbitrary cube Q, we can find a pair of dyadic cubes Qi, Q 2 E S> such 
that Qi Z Q Z Q 2 and the volume ratios and are not too small 
or too large respectively. Finding Qi is never a problem: just take the 
cube in S> containing c{Q) whose sidelength is between ^q/ 4 and ^q/2 . 
However, it may easily happen that we need to go quite high up in the 
dyadic lattice to meet the first single cube covering Q (see Figure 4). 


^The standard definition of the Lebesgue outer measure uses arbitrary paral¬ 
lelepipeds, but each parallelepiped can be covered by finitely many dyadic cubes of 
the total volume as close to the volume of the parallelepiped as one wants. 
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Figure 4. The smallest cube Q 2 E ^ containing Q can 
be arbitrarily many times larger than Q. 

We see that we can cover Q by the union of few adjacent cubes of 
comparable size though. More precisely, if we take the dyadic cube Q 2 
containing x{Q) and such that £q/2 < £q^ ^ £q, then 

Q c Q 2 = x{Q2) + ^{Q2 ~ x{Q2)) 

(this cube with the same corner as Q 2 but of triple size is comprised 
of 3" lattice cubes). Unfortunately, the set ^ = {Q : Q E is not 
a dyadic lattice. Or is it? No, of course not: the cubes in ^ overlap 
in fancy ways, which is not allowed for cubes in one lattice. In one 
lattice? And now the next logical step is inevitable: is not a single 

lattice, indeed, but it is a union of several (3"') dyadic lattices. 

Theorem 3.1. (The Three Lattice Theorem) For every dyadic lat¬ 
tice , there exist 3"' dyadic lattices ..., such that 

3 " 

^ = {x{Q) + 3(g - x{Q)) : g e ^} = U 

i=i 

and for every cube Q E and j = 1,..., 3"', there exists a unique cube 
R E of sidelength £^ = Mq containing Q. 

Remark 3.2. An immediate corollary to the Three Lattice The¬ 
orem is that for every m = 1 , 2 ,..., there exist 3™'"' dyadic lattices 
^( 1 ),..., such that for each set of arbitrary cubes gi,..., Qm, 

one can hnd i G {1, ..., 3™'"'} and cubes gi, ..., Qm E so that for 
each k = 1,... ,m, one has Qk D Qk and ig^l ^ 3™'"'|gfc|. 

This can be easily proved by induction on m. The base case m = 1 
immediately follows from the discussion before the statement of the 
theorem. Just take any dyadic lattice ^ and consider the lattices 
^U) given by the hrst assertion of the theorem. Note that the second 
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assertion has not been used for this case though it will be crucial for 
the induction step. 

Assume now that the claim holds for some m and ^(1), . . . , 
are some dyadic lattices satisfying the required property. Applying the 
Three Lattice Theorem to each of these lattices, we get a new set of 
3 (m+i)n lattices so that for each i, the set 

%) = {x{Q) + 3(g - x{Q)) : Q e %)} 

is the union of j = 1,...,3”'. To finish the induction step, it 
will suffice to show that this new set of lattices satisfies the required 
property for m + 1. 

Take any cubes Qi,, Qm+i- By the induction assumption, we can 
hnd i and some cubes Q'j^ G (/c = 1,..., m) so that Q'^. D Qk and 
IQfcl ^ 3™'"'|gfc|- This leaves us with the last cube Qm+i to take care of. 
As it was observed earlier, we can find j and a cube Qm+i G so that 

Qm+l ^ Qm+1 and \Qm+i\ ^ ^ it remains to 

note that, by the second assertion in the Three Lattice Theorem, we 
can extend each cube Q'f. G .^(q to a 3 times larger cube Qk G 

Proof of Theorem 3.1. We start with a picture in Mb 


^k+i 








Figure 5. The lattice ^ and three lattices shown 
at two consecutive generations. 


For the formal argument, it will be convenient to assume that Qo = 
[0,1)” G S'o (which can always be achieved by translation and scal¬ 
ing). Then every other cube Q E can be represented as x -|- 2~'^Qq, 
where x is a dyadic rational vector; more precisely, 2™'x G Z” for 
m ^ max(/c,0). Note that the “corner child” of Q = x -f 2“^go ^ 
is X 2“(*'+^)go e ^k+i and that all other cubes in are given by 
X-f 2 -^m + 2-^Qo {u G Z"). 
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Consider the generation = {x -h 3Qo : x G Z^j of triple cubes 
corresponding to cubes in Any cube Q = x + 3Qo can (and must) 

be put into the same dyadic lattice with all cubes Q' = x' + SQo 
for which x — x'& (SZ)”. This gives us an easy and natural way to split 
one generation into 3” tilings. Introduce the equivalence relation 
on Z" by x ~ 1 / a; — 1 / G (3Z)”. The cubes x + 3(50, y + 3Qo £ 
are put in the same tiling if and only if x ^ y. 

Since 2 and 3 are coprime, the divisibility of an integer by 3 is not 
affected by multiplication or division by 2, so we can easily extend this 
equivalence relation to the entire set of corners of all dyadic cubes in 
S) saying that the corners x' and x” of two cubes in Si are equivalent 
if 2'^{x' — x") G (dZ)” for sufficiently large m. Since our equivalence 
relation is invariant under dyadic scaling, we see that the cubes Q E Sk 
with corners from one equivalence class tile M” for each k E Z the same 
way as it was for k = 0. Moreover, each cube Q G is contained in 
some cube of this tiling. 



Figure 6. The tilings of by triple cubes with corners 
in two different equivalence classes (blue and red). 


We have already seen that there are 3” equivalence classes in each 
generation. Note now that if we have 3"’ + 1 dyadic rational vectors Xj 
and take m so large that 2'^Xj E Z” for all j, then by the pigeonhole 
principle, the difference of some two of the integer vectors 2"^Xj is in 
(3Z)”, so there cannot be more than 3" equivalence classes total. Thus 

(a) there are exactly 3"' equivalence classes £j] 

(b) for each k, the cubes Q E Sk with corners in the same equiva¬ 
lence class Sj tile M” in a regular way and the original tiling of M” by 
the cubes Q G is a rehnement of this tiling. 
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Denote ^ 

= {Q e ^ : x(Q) e Sj}. 

We will show that each is a dyadic multiresolution. Property 
(b) already establishes that each satishes (DM-2): just take any 
compact iP C M”, hnd a cube Q in some with Q D K, and then 
take the cube Q E S>k containing Q with x{Q) G £j. 

To show (DM-1), take any cube Q = x{Q) + 3{Q — x{Q)) G ■ 
Then x{Q) = x{Q) G £j. The corner child Q' of Q is just x{Q) + 
3{Q' — x(Q)) where Q' is the corner child of Q. Since Q' and Q share 
the common corner, it follows that Q' G ^k+i Remark 2.3, 

that consists exactly of the dyadic children of the cubes in . 

Finally, the last statement in property (b) is equivalent to the last 
assertion of the theorem. □ 

We want to emphasize that the key non-trivial part of the argument 
above is hidden not in the verihcation of some particular property: each 
of those is fairly straightforward. Neither is it hidden in some inge¬ 
nious step or in combining the intermediate statements in some fancy 
ways, though the order of steps does matter if one wants to run the 
argument in a smooth way and without unnecessary repetitions. The 
crucial point is the very possibility to extend the equivalence relation 
that naturally arises when looking at one generation to all other gen¬ 
erations and, most importantly, across generations in one wide sweep. 
It is this extension across generations, which requires the coprimality 
of the numbers 2 and 3 to work and which is ultimately responsible for 
making it possible to combine the individual tilings coming from dif¬ 
ferent generations into hnitely many complete dyadic multiresolutions 
(the reader is encouraged to try to draw a picture similar to Figure 5 
to see what will go wrong if some even number is used in place of 3). 

4. The forest structure on a subset of a dyadic lattice 

Let ^ be a dyadic lattice. Let 5 C ^ be some family of dyadic 
cubes. We will view S as the set of vertices of a graph F^. We will 
join two cubes Q,Q' G iS by a graph edge if Q' C Q and there is 
no intermediate cube Q" G S (i.e., a cube such that Q' ^ Q" ^ Q) 
between Q and Q'. 

Though we prefer to view F^ as a non-oriented graph, there are two 
natural directions of motion on F^: up (from smaller cubes to larger 
ones) and down (from larger cubes to smaller ones). 

Note that the graph F^ cannot contain any cycle because the “low¬ 
est” (smallest) cube Q in that cycle would have to connect to two 
different cubes Q' and Q" of larger size. Then Q’ H Q" D Q 0, so 
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either Q' D Q'\ or Q" D Q', but in each case one of the cubes cannot 
be connected to Q because another one is on the way. 

As usual, we can dehne the graph distance ds{Q', Q") between two 
cubes Q', Q” G 5 as the number of graph edges in the shortest path 
from Q' to Q" (if there is no such path, we put d(Q', Q") = +cxo). 

Note that if Q' D Q", then we can ascend from Q" to Q' in the 
full dyadic lattice 3) in hnitely many steps, each of which goes from a 
cube to its parent. Since the total ascent is hnite, we can meet only 
hnitely many cubes in S on the way. Moreover, any path from Q" to 
Q' in goes through every cube in this ascent. These three simple 
observations have several useful implications, which we will now state 
for future reference. 

1) If Q',Q" G S and Q' ^ Q", then ds{Q',Q'') is hnite and equals 
1 plus the number of intermediate cubes in S between Q' and Q". 

2) Let R E 3. Assume that there is at least one cube in S con¬ 
taining R. Then we shall call the smallest dyadic cube in S containing 
R the iS-roof of R and denote it by Rg. Note that Rg is also the hrst 
cube in S we meet on the upward path in T^ starting from R. 

The cubes R E 3 that have no iS-roof are contained in no cube 
from S. All other cubes naturally split into houses 

Hg{Q) = {RE3-.Rg = Q] 

of cubes located under the same roof Q E S (see Figure 8). The cubes 
in Hg{Q) are the cubes one can reach on the way down from Q in 
T^ before meeting another cube in S. Note that each cube R E S is 
its own roof and the next cube in S on the way up from R (the cube 
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connected to R by an edge in F^) is not the iS-roof of i?, but the iS-roof 
of the parent of R. 



Figure 8. The roofs Q E S (red intervals under blue 
triangles) and the houses Hs{Q) (with red walls). The 
house of the top interval is highlighted in green. 

3) If for every compact set K C M"' there exists a cube Q E S 
containing K (we will call such families regular), then T^ is connected 
and 3! = ^QdsHsiQ)- 

4) If Q' D Q” and Q\ Q" E S' C S, then ds^Q', Q") ^ ds{Q', Q"). 
This is also true for the case when the cubes are disjoint, provided that 
ds'{Q',Q") < +C)0, but we will never need this fact, so we leave its 
proof to the reader. 

5) If Q,Q' E S, R E HsiQ), and R D Q', then d^{R,Q') ^ 
ds{Q,Q'). This follows from the fact that to reach R from Q' one 
needs to pass through all intermediate cubes Q" E S on the way up. 

6) If Q G iS and Qi, ■ ■ ■ ,Qm £ S are some subcubes of Q with 
dsiQi Qi) = ■ ■ ■ = dsiQi Qm), then the cubes Qi,, Qm are pairwise 
disjoint. 


5. Stopping times and augmentation 

5.1. Stopping times. Suppose that we have some condition (a 
boolean function V : {(Q, Q') E 3 x 3 : Q D Q'} -E {true, false}) that 
may hold or not hold for every pair of dyadic cubes Q ^ Q', but such 
that V{Q,Q) is always true. 

Take any cube Q C 3. We say that a cube Q' C Q can be reached 
from Q in one step if V{Q, Q') fails but P(Q, Q") holds for all Q" with 
Q' ^ Q" C Q. We say that a cube Q' <Z Q can be reached from Q in m 
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steps if there is a chain Q = Qo D Qi D ■ ■ ■ D Qm = Q' in which Qj+i 
can be reached in one step from Qj{j = 0,l,...,m — 1). The system of 
all cnbes that can be reached from Q in hnitely many steps (inclnding 
Q, which can be reached from itself in 0 steps) is called the system of 
stopping cnbes associated with the initial cnbe Q and condition V and 
denoted stop(Q,P) or, merely, stop((5) if the condition is clear from 
the context. 


Figure 9. The system stop(Q,P) (thick intervals) for 
the condition “Q and Q' are of the same color”. 

The graph rstop(Q,p) is a tree with the top cube Q. The cubes 
Q' that can be reached from Q in m steps are exactly those with 
dstop{Q,'P){Q 1 Q ) nr. 

The most important feature of the system stop(Q,P) is that for 
every dyadic subcube R C Q, its roof Rstop{Q,v) is well-dehned and 
satishes V{Rstop{Q,r)^ R) = true. Indeed, since Q G stop(Q), and R C 
Q, the stop(Q)-roof of R exists. On the other hand, if we descend from 
Rstop{Q,v) to R, the condition V(Rstop(Q,v)i R') should hold for all cubes 
R' on the way (including R). Otherwise the hrst cube R' on that way 
down for which it fails can be reached from Rstop{Q,v) in one step, so 
it also belongs to stop((5,T’) and contains R. By the dehnition of the 
roof, we then must have R' = Rstop{Q,v)-i bnt this is impossible because 
R {,Rstop{Q , 1 ^) 1 Rstop{Q ^v)) true. 

Thus, the whole set ViQ) of subcubes of Q is a disjoint union of the 
houses Hstop{Q,v){Q')^ Q' ^ stop(Q,P) and for every Q' G stop(Q,P) 
and R G hfstop(Q,p)(QO) i'h® condition V{Q',R) holds. 

Another useful property is self-similarity: if Q' G stop(Q,P), then 
the family of all subcubes of Q' in stop(Q,P) is exactly stop(Q',P). 

5.2. Augmentation. Suppose that S is any subset of ^ and that 
with every cube Q E S some family J-{Q) of dyadic subcubes of Q 
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is associated so that Q G J^{Q). The augmented family S is then 
the union UQg5J^((5) where J^{Q) consists of all cubes Q' E J^{Q) 
that are not contained in any R E S with R ^ Q (see Figure 10). 
The augmented family S always contains the original family S but, in 
general, can be much larger. 




H—I—I——I——I—I I — I — i i i 


I I I A l \j I I I I mVA A-I I I I I I I ^ I I ^ I I I I I H-H 

Figure 10. The families R{Q) (all blue rectangles) and 
J-{Q) (solid blue rectangles) for an interval Q (the top 
interval) in S (red triangles). 

An important case of augmentation is when R{Q) = stop(Q, V) for 
some condition V. In this case, the augmented system S contains S 
and enjoys the property that V{Rg, R) holds for every R E Indeed, 
Rg E stop((5, P) for some Q E S. Then R^ = Rstop{Q,p) because the 
only way the two could be different is that Rstop{Q,p) would be removed 
as a cube contained in some Q' ^ Q from S, but in that case Q' would 
block all ways up from R to R{Q) as well. 

6. Sparse and Carleson families 

Let ^ be a dyadic lattice. 

Definition 6.1. Let 0 < r; < 1. A collection of cubes S c ^ 
is called r^-sparse if one can choose pairwise disjoint measurable sets 
Eq (Z Q with \Eq\ ^ r]\Q\ {Q E S). 

Definition 6.2. Let A > 1. A family of cubes 5 C ^ is called 
A-Carleson if for every cube Q E ^ we have 

I^Kaiqi. 

P€S,PCQ 
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6.1. The equivalence of the Carleson and sparseness con¬ 
ditions. It is almost obvious that every r^-sparse family is ^-Carleson. 
Namely, we can just write 

Z ibfKv'ioi. 

Pes,PcQ Pes,P(iQ 

That every A-Carleson family is ^-sparse is, however, much less obvious 
(which was one of the reasons for the duplication of terminology in the 
first place), so our hrst goal will be to show exactly that. 

Lemma 6.3. If S is A-Carleson, then S is -^-sparse. 


Proof. It would be easy to construct the sets Eq if the lattice 
had a bottom layer Then we would start with considering all 

cubes Q G iS n and choose any sets Eq C Q of measure ^|Q| 
for them. After that we would just go up layer by layer and for each 
cube Q G S n 3>k,k ^ K, choose a subset Eq of measure ^IQI in 
Q \ U_Rg 5 ,_RCQL'i?. Note that for every Q E S, we have 


U 

ReS,R^Q 



ReS,R'^Q 


A- 1 
A 


|Q| = (l-l)|Q|, 


SO such choice is always possible. 

The problem with the absence of a bottom layer is that going down 
in a similarly simple way is not feasible. A natural idea is to run the 
above construction for each K = 0,1, 2,... and then pass to the limit, 
but we need to make sure that the resulting subsets do not jump around 
wildly. Fortunately, it is not that hard. All we have to do is to replace 
“free choice” with “canonical choice”. 

Fix iF ^ 0. For Q G iS fl iflk^K^k) define the sets E^q^ inductively 

as follows. If Q G iS n set E^q"^ = x{Q) + A“^/"'(Q — x{Q)) (the 
cube with the same corner as Q of measure \E^q'^\ = ;y|Q|). For Q G 
iS n fc < JF, the sets E^q'^ will not necessarily be cubes. Namely, if 

E^^^ are already defined for R G iSn(Ufc+is:is;i^^i), then for Q G SnS^k, 
set 

4*‘) = fWuP(q) + ((q-x(q))), 

where 

U 

R€S,R^Q 

and f G [0,1] is the largest number for which the set 

C’ = l^(Q) + ‘(Q - “^(Q))l \ ■f’?’ 

satisfies \E^'^\ ^ y|Q| (see Figure 11). 
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Figure 11. The construction from the bottom level 
(brown) to 4 levels up (yellow). For the largest cube 
Q ^ S shown, the set is the total colored area, the 

set is the yellow area, and the set E^'^ is the area 
colored with colors other than yellow. 


The above considerations (together with the continuity and mono¬ 
tonicity of the function t |[a^(Q) + t{Q — x{Q))] \ E^'^\) imply that 
such t exists and we have \E^q\ = ;y|Q|. 

We claim that E^q^ C E^q^^^ for every Q G iS fl Indeed, 

if Q G &K, then E^q'^ is just the cube with corner x{Q) of volume y |Q|. 

On the other hand, contains a cube with the same corner for 

which the volume of the difference between that cube and some other 
set is as large. It remains to note that out of two cubes with the same 
corner the cube of larger volume contains the cube of smaller volume. 

From here we can proceed by backward induction. Assume that 
E^q^ C for every Q G iS O {'<^k<ii^K^i)- Take any Q G iS fl ^k- 

Then, the induction assumption implies that E^'^ C Let Q' = 

x{Q)+t'{Q — x{Q)) be the cube added to E^'^ when constructing E^\ 
Then 

SO the value of t that is chosen in the construction of is not less 

than t'. Thus, the cube that we add to when building 

contains the cube that we add to Ej^'^ when building Ej^\ and the 
claim follows. 
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Now, for Q G iS n &ki define 

OO 

K=k 

For each K, we have 

Note now that the sets also form an increasing (in K) sequence, 
so for each Q E S, the limit set 

£« = Jto<> = £,\(jtoFW)=g,\( U E,) 

R&S,R'^Q 

exists, is contained in Q and has the required measure. Finally, Eg 
are, obviously, disjoint. □ 

The sparseness property is something that can be readily used when 
working with systems of cubes that are already known to be sparse 
while the Carleson property is something that can be easily verihed in 
many cases where the sparseness condition is not obvious at all. For 
example, it is clear straight from the dehnition that the union of N 
Carleson systems with constants Ai,..., A^r is a Carleson system with 
constant Ai + • ■ • + Aat, while to see directly that the union of r^^-sparse 
systems is ? 7 j"^)“^-sparse is next to impossible. 

6.2. Anti-Carleson stack criterion. We will now present a sim¬ 
ple criterion for a system of cubes to be Carleson. 

Definition 6.4. Let Q E ^ and r] e (0,1). An ? 7 -anti-Carleson 
stack of height M with the top cube Q is a hnite family F C T>{Q) 
containing Q and such that 

Q'(iTAFQ',Q)=M 

Figure 12 depicts an anti-Carleson stack of hight 3 with rj ps 0.7. 
Note that the picture is layered primarily according to the distance djF, 
so some longer intervals are drawn lower than some shorter ones, which 
is opposite to how they would appear in the standard generation by 
generation layout of 

Note that if iS C ^ is some family of cubes and we have an r^-anti- 
Carleson stack C iS of height M with the top cube Q, then 

F = {Q' E S ■. Q' G Q, ds{Q, Q') < M} 

is also an ? 7 -anti-Carleson stack of height M. This is an immediate 
consequence of 3 observations: 
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Figure 12. An anti-Carleson stack T of height 3 with 
the top interval Q shown in red. The intervals Q' ^ T 
with djr{Q,Q') = 1,2,3 are shown in green, bine, and 
orange respectively. 


1) since C S, every cnbe Q' E with djr{Q, Q') = M is contained 
in some Q" G with ds{Q, Q") = M; 

2) for every Q" G T, ds{Q, Q”) = d^{Q, Q"); 

3) in any family of dyadic snbcnbes of Q, the cnbes with some hxed 
distance to the top cnbe Q are pairwise disjoint. 

Hence, 


>)IQI « I*?' 

Q’&T,dT{Q,Q')=M 


U Q' 

Q'eT,dT{Q,Q')=M 


U Q" 

Q"eJ^,ds(Q,Q")=M 


E i«' 

Q”(iT,df{Q,Q'')=M 


Our next observation is that for any hxed > 0, a Carleson family 
cannot contain an r^-anti-Carleson stack of arbitrarily large height 
M. Indeed, since for k ^ 1, each cube Q' with djr{Q,Q') = k is 
contained in some Q” with djr{Q, Q”) = /c — 1, we have 

E iQ'i > ’)i«i 

Q'^T^djr{Q^Q')=k 


for every fc = 0,1,..., M, from where it is clear that if T is contained 
in a A-Carleson family of cubes, we must have (M + 1)?7 ^ A. 

We will now prove a statement going in the opposite direction. 


Lemma 6.5. Let g (0,1). If S <Z ^ contains no rj-anti-Carles on 
stack of height M, then S is Carleson. 
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Proof. First of all, notice that S is A-Carleson if and only if every 
finite subsystem S' of S is A-Carleson. Second, notice that the A- 
Carleson condition for S can be checked for cubes Q E S only. Indeed, 
if it holds for every cube Q E S, take any cube R E S' and consider 
the family R of all cubes Q E S with Q <Z R that are not contained in 
any other cube Q' E S contained in R. Those cubes are disjoint and 
every other subcube of R from S is contained in one of them, so we 
can write 

E i»'i 


establishing the A-Carleson property for every R E 

Now take any hnite S' C S. By the assumption of the lemma, 
S' contains no r^-anti-Carleson stack of height M. Observe that ev¬ 
ery hnite family is Carleson (with Carleson constant not exceeding the 
number of cubes in the family, say). Let A be the best Carleson con¬ 
stant for S'. Let Q E S'. Let Rk{Q) be the set of all subcubes Q' E S' 
of Q with ds'{Q,Q') = k. Then ^ every cube 

from any Rj. with k ^ M is contained in some Q' E Rm- Thus, taking 
into account that the cubes in R^ are disjoint for each k, we get 

M-l 

= E E i«i+ E E w 

k=0 ReTk Q'&Tm R&S',R(ZQ' 

^ MIQI+A ^ \Q'\ ^ M\Q\ + riA\Q\. 

Q'&Tm 

Since this inequality holds for all Q E S', we get A ^ M + rjA, i.e., 
A < □ 

^ l-T? 

6.3. Improving the Carleson constant by partitioning the 
family. Some estimates are easier to carry out when the Carleson and 
sparseness constants are close to 1. Usually, these estimates are of 
such nature that if they are obtained for iSi,..., Sj^, then they trivially 
follow for the union S = iSi U • • ■ U Sj^. So, one may be tempted to 
try to split a A-Carleson system with large A into several 5-Carleson 
systems with S < A (preferably just slightly above 1). The following 
lemma shows that it is always possible. 

Lemma 6.6. If S is a A-Carleson system and m is an integer ^ 2, 
then S can be written as a union of m systems Sj, each of which is 
1 -|- Carleson. 


E i«i 

R&S',RcQ 


= E E 

Q'€RW£S,WcQ' 

^ AIQ'I = a| IJ g'l < A|I?|, 

Q'€R Q'dR 
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Proof. As we saw above, we can check the Carleson condition 
for any family 5' C ^ on the cubes Q G S' only. Now consider 
the graph P^. It may be disconnected, but then every 2 cubes from 
different connected components are disjoint, so it will suffice to split 
each connected component into 1 + ^^^-Carleson systems separately. 
Thus, we can assume that T^ is connected, which means that any 
upward paths starting from two (or more) cubes in S merge at some 
cube containing both (all) of them. 

Now we can dehne an equivalence relation on S as follows: take any 
two cubes Q', Q" G S and hnd any cube R E S that contains both of 
them. We say that Q' is equivalent to Q" if ds{Q',R) — ds{Q",R) is 
divisible by m. Note that if we have some other cube R' E S containing 
both Q' and Q", then either R C R', or R' C R, so 

ds{Q',R') = ds{Q',R)±ds{R,R') 

respectively and the same is true for Q”. Thus, the difference ds{Q', R) — 
ds{Q", R) does not depend on the choice of R G> Q' U Q". 



Figure 13. The equivalence classes (with m = 3) shown 
on the graph T^ in green, orange, and yellow. 

Let iS' C iS be one of the equivalence classes with respect to this 
relation. Take Q E S' and consider the families 

Rk = {Q' eS -.Q' c g, dsiQ', Q) = k}. 

Then, by dehnition, the set {Q' E S' : Q' G Q} is the union of Rq = 
{Q}, Rm, R2m, etc. We also have 

E i«'i E i«'i 

Q'GJ'fc Q'&Tml 
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for every k = m{l — 1) + 1,, ml. Thus, 

OO OO 

E i«'i ^ i<2i+™E E i«'i 

k=0 Q'GJ^k ^—1 Q'sJ~ml 

= \Q\+m IQ'I, 

Q'eS',Q%Q 

whence 

IQ'I « IQI + —iQi, 

m 

Q'&S',Q'(lQ 

proving the claim. □ 

6.4. Augmentation of Carleson families. 

Lemma 6.7. The augmentation of a Ao-Carleson family S by A- 
Carleson families J^{Q) is a A(Ao + 1)- Carleson family. 

Proof. Take any cube Q E The subcubes of Q can appear in 
the augmented system S either from iF{Qs) or from some iF{Q') with 
Q' G V^Q) n iS. Since iF{Qs) is A-Carleson, we have 

E I^KAIQI- 

ReT{Qs),RcQ 

On the other hand, 

5^ iq'kaoAm. 

Q'eS,Q'cQ R&T{Q') Q'GS,Q'CQ 

Adding these inequalities, we get the desired bound. □ 

Since we will often need to deal with stopping time augmentations 
of Carleson families, it would be nice to find some easy to use criterion 
for a condition V to be of Carleson type in the sense that for every 
Q E the family stop(Q,P) is Carleson with uniformly controllable 
Carleson constant. 

A simple sufficient condition is the following. 

Lemma 6.8. Assume that for every cube Q E 3l and any pairwise 
disjoint Q[,..., Q'^ C Q such that V{Q, Qb) fails for each j, one has 
Y.j \Qj I ^ ri\Q\ with some rj < 1. Then stop(Q,V) is -^-Carleson for 
every Q. 

Proof. This condition rules out r^-anti-Carleson stacks of height 1 
in stop((5,7^). It remains to apply Lemma 6.5. □ 
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7. Prom the theory to applications 

The Carleson families already lie on the border between “basic” and 
“intermediate” tools, so we shall stop the general theory here and pass 
to an example of how dyadic techniques can be applied to some prob¬ 
lems in analysis that do not contain any dyadic lattices or anything 
else like that in their original formulations. The particular applica¬ 
tion we chose for this short book is the weighted norm inequalities for 
multilinear Calderon-Zygmund operators. We will restrict ourselves to 
the most classical part of the theory and just prove the sharp bounds 
in terms of the joint Muckenhoupt norm of the weights alone leaving 
without comment more rehned estimates involving Aoo-norms. Still, 
we hope that a reader who does not know this particular subject in 
and out will hnd the exposition both accessible and instructive. 


8. The multilinear Calderon-Zygmund operators 

The most classical, known, and loved singular integral operator is 
the Hilbert transform on the real line: 

fiy) 


( 8 . 1 ) 


Hf{x) = 


x-y 


-dy. 


The function y i-4 is not integrable near x, so one can apply this for¬ 
mula verbatim without any reservations only if x is outside the (closed) 

support of /. Nevertheless, it turns out that some tweaking (consider- 

r -F f \ 

ing the principal value lim / - —dy, using Fourier transform, 

J{y.\x-y\>e} ^ - V 

etc.) allows one to show that H can be made sense of as a nice bounded 
operator on every L^(M) with 1 < p < oo, and even if / G (which is 
the harshest case for making sense of the integral formula (8.1)), then 
Hf is hnite almost everywhere and satishes the weak type estimate 

|{x e M : \Hf{x)\ > a}\ ^ —||/||li 


a 


for every a > 0. 

The most straightforward 
the Riesz transform: 

Rf{x) = j 


analogue of the Hilbert transform is 


X 


y 


X 


y\ 


n+l 


f{y)dy. 


It is often very convenient to view Rf as a vector-valued function, but 
one can always project to any direction e G M” and consider 

y,e) 


Ref{x) = 


[X 


\x - y\ 


n+l 


f{y)dy 


instead. 
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In general, we can consider arbitrary operators 
Tf{x)= f K{x,y)f{y)dy 

with the so-called Calderon-Zygmund kernels. The development of the 
classical non-weighted theory of linear Calderon-Zygmund operators 
(even for the special case of the Hilbert transform) is beyond the scope 
of this book. An interested reader can hnd a good exposition in [3, 4, 
13]. We will just note here that the main 3 properties that make the 
whole theory work are the following. 


(i) The scale and shift invariant estimate of the kernel: 


\K{x,y)\ ^ 


C 

X — 1/|” 


(which makes t^K{tx,ty) as good (or as bad) as K, so if we 
conjugate the operator action by an affine change of variable 
X a + tx, the entire theory stays invariant). 

(ii) Some reasonable (and scale and shift invariant) continuity of 
the kernel away from the diagonal; if Q is any cube, x', x” G Q 
and y ^ Q\^t\ = c{Q) + t{Q — c{Q)) with f ^ 2, then 

\K{x\y) - K{x",y)\ ^ 

where p : [0, -|-cxo) —)■ [0, -1-cxo) is some modulus of continu¬ 
ity (continuous, increasing and subadditive function such that 
p(0) = 0) that tends to 0 not too slowly near 0. Note that it 
is almost always required that the same condition holds with 
the roles of x and y exchanged, but for our current purposes 
this part will suffice. 

(iii) Some cancellation in the kernel (antisymmetry, zero average, 
etc.). This part is crucial for the classical theory but here, 
where we just take all statements of the classical non-weighted 
theory for granted, we are not concerned with it in any way. 

A multilinear (m-linear) Calderon-Zygmund operator takes in m 
functions fi,..., fm and its kernel K{x,yi,..., ym) depends on m -|- 1 
variables x,yi,... ,ym G M". The result is dehned as 


T[fi, . . . , frn]{x) 



K{x,yi,..., ym)f{yi) ■ ■ ■ f{ym)dyi... dym 


when the integral makes sense (for a set i? C i?” we use a notation 
= E X ■■■ X E) and the kernel is allowed to have a singularity only 
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ioT yi = ■ ■ ■ = Um = X. The scale and shift invariant bonnd will then 
be 


\K{x,yi,.. .,ym)\ ^ 


C 


(max \x — yi\Y'^ 

i 

and the corresponding continuity property can be stated as 


\K{x',yi,...,ym) - K{x'',yi,...,ym)\ ^ 


C 






whenever x\x'' E Q, t ^ 2, and there exists i such that yi ^ Q[t]. 

The only nontrivial result of the classical theory we shall need below 
is the weak type bound 


KieB": |r[/i,...,/™lWI >a}| « c 


1/m 




2=1 


which we will just postulate. 

At last, a few words should be said about the sense in which the 
action of T in various weighted spaces of functions is understood. 
The weak type property is, of course, formally contingent upon that 
T[/i,..., /„] is well-dehned for any /j G as an almost everywhere 
hnite function. Still, a function from an arbitrary weighted space does 
not need to be even in without some restrictions on the weight. 
What saves the day is that there is a linear space of functions that is 
contained in L} as well as in any weighted U’ and dense there, namely 
the space of bounded measurable functions with compact support. 
So, we will use it to take an easy way out: we will say that T is bounded 
as an operator from some product of weighted L^-spaces to another 
weighted L^-space if the corresponding inequality for the norms holds 
for all test functions /* G Thus, we may always assume that all 
our test functions below belong to though for most arguments it is 
not really necessary. 


9. Controlling values of T[/i,..., fm] on a cube 

Our hrst goal will be to get an efficient pointwise estimate of g = 
T[fi,...,fm] in terms of averages \ Mq = Jq \fi\ of functions fi over 
various dyadic cubes. 

The decay of the kernel K{x,yi,..., ym) at inhnity is too weak to 
make the trivial bound 

P m 

lfi'(a^)l ^ / \K{x,yi,...,y^)\W_\fi{yi)\dyi...dym 

really useful even when the integral is hnite. However, the continuity 
of the kernel in x allows one to estimate the difference of values of g at 
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two points x', x” in some cube Q by 

[l^[/lXOp], • ■ • , fmXQ[2^{x')\ + |T[/iXq,2,, . • ■ > fmXQp]]i^”)\ 

oo „ 

+ ^ / \K{x',y)-K{x",y)\-\F{y)\dy = Ii + l2, 

k=i 

where f= (|/i,..., i/^), F(y) = f{yi) ■ ■ ■ f{ym)- 
By the continuity property of K, 

OO OO m 

h « C^p(2-‘)|F|g» = C5 ;p( 2-‘) n 

/c=l /c = l 2=1 

Note now that the weak type bound for T implies that for the set 


Ga = {xeQ : \T[fiXQ[2]^ ■ ■ ■ JrnXQi2]]i^)\ > 


we have 


C 


Xjra 




l<2|. 


2=1 


Hence, taking a = C\ Hili I/jIqp]! obtain that \Ga\ ^ A|Q| and 
for every x', x" E E = Q \ Ga, 


\h\^2G\-^l[\f, 


dQp] • 


2=1 


Therefore, we have proved that for every cube Q, there exists a set 
E C Q with \E\ ^ (1 — A)|(5| and such that for all x',x" G E, 

OO m 

|9(x') - g(x'')\ < C(A) ^p(2-‘) n , 

fc=0 i=l 

This phenomenon merits a few dehnitions and a separate discussion. 


10. Oscillation and A-oscillation 

Let / be a measurable function dehned on some non-empty set E. 
The oscillation of / on T' is just 

^(/; E) = sup / - inf /, 

E ^ 

i.e., the length of the shortest closed interval containing f{E). The 
oscillation is hnite when / is bounded on E. 

Assume now that / is measurable and finite almost everywhere on 
some measurable set Q C M"' of finite positive measure. Let A G (0,1). 
The A-oscillation of / on Q is defined as 

uxif; Q) = inf{u;(/; E):EcQ, \E\ ^ (1 - A)|g|}. 
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Note that, unlike the usual oscillation a;(/; Q), the A-oscillation Q) 
is hnite for any measurable function / that is finite almost everywhere 
on Q, which makes it a much more flexible tool. 

The first thing we note is that the inhmum in the definition of 
uj\{f-iQ) is, actually, a minimum as the following lemma shows. 

Lemma 10.1. For every A G (0,1), there exists a set E G Q of 
measure \E\ ^ (1 — A)|Q| with ux{f-, Q) = oj{f] E). 

Proof. There are subsets E^ C Q with \Ek\ ^ (1 — A)|Q| and 
intervals Ik = [ak,ak + u{f-, Ek)] C M such that f{Ek) C h and 
u){f;Ek) I uxif'jQ)- The crucial point is that we cannot have more 
than pairwise disjoint intervals Ik^ with this property because oth¬ 
erwise 

IQI ^ I V /-‘(4,)| = |r‘(4,)| > - A)|Q| = |Q|, 

j 

Thus, {ofe} is a bounded sequence and, by Bolzano-Weierstrass, passing 
to a subsequence, if necessary, we can assume without loss of generality 
that Ofc —)■ a as fc —)■ oo. 

Now just put E = limsupfc_,.oo L’fc = Hn^i ^k^n Ek. Note that 
\E\ ^ (1 — A)|Q| and for every x E E there is a subsequence kj such 
that 

% ^ f{x) ^ Ofc. +uj{f;Ekj) 
for all j, and, thereby, 

a < f{x) ^ a + uJxif;Q)- 


□ 


Observe that the set E C Q in Lemma 10.1 does not need to be 
unique. For example, let Q = [0,1) and f{x) = x. Then 

ux{f;Q) = l-X = u{f-E,,x) 

for all intervals Ea^x = [a, a -|- 1 — A) with 0 ^ a ^ A. 


10.1. The oscillation chain bonnd for a measurable func¬ 
tion. We are now going to capitalize on the trivial observation that 
if El,... ,Ek is a chain of sets such that Ej 0 ^ 0 for all j = 

1,..., k — 1, then for every x G Ei, y G Ek, one has 

k 

\f{x)- f{y)\ ^ '^uif^Ej). 

i=i 
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Theorem 10.2. Let / : M” —)■ M any measurable almost every¬ 
where finite function such that for every e > 0, 

(10.1) \{x e [-R,Rr ■ \ f{x)\ > e}| = o{R^) as R ^ oo. 

Then for every dyadic lattice Ss and every A G (0,2“"'“^], there exists 
a regular^ 6-Carleson family S <Z 3> (depending on f) such that 

I/I ^ 5^^a(/;Q)xq 

Q^S 

almost everywhere. 

Proof. For every cube Q G fix a set E{Q) C Q such that 
\E{Q)\ ^ (1 —A)|Q| and a;(/; T'((5)) = u}x{f] Q). We say that two cubes 
Q D Q' are linked if E{Q) fl E{Q') 0. Clearly, if Qat D • • • D 

is a chain of nested cubes in which Qj+i and Qj are linked for every 
j = 1,..., iV — 1, then 

N 

u{f; E{Qfi U ■ ■ • U E{Qm)) ^ 5^a;(/; E{Qfi). 

i=i 

We call a family S linked if every two cubes Q' C Q” in S with 
ds{Q', Q") = 1 are linked. Assume that S is linked and regular. Then, 
starting from any cube Q E S, we can go up and enumerate all cubes 
from S we meet on the way: Q = Qi G Q 2 .... Take a point 
X G E{Q). For every ^ 1, we have 

N-l 

\fix)\ ^'^u}{f-,E{Qj)) + sup |/|. 
j=l E{Qn) 

However, condition (10.1) implies that for every e > 0, E{Q]sf) inter¬ 
sects the set {I/I < e} if the cube Qat is sufficiently large, so in this 
case, 

sup I/I ^ e + u{f-,E{QN)). 

E(Qn) 

Since e was arbitrary, we conclude that then |/(x)| ^ '^(/j ^i.Qj))i 

which can be restated as 

I/I ^ ^^x{f]Q)XQ 

QgS 

on Uq(zsE{Q). 

Now it becomes clear how the system S should be constructed. We 
should just first make sure that the entire lattice is linked and then to 
rarefy it as much as possible by removing the intermediate cubes in long 

^Recall that a family 5 C is regular if for each compact K C M", there exists 
Q E S containing K. 
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linked descending chains when a direct shortcut is available. The hrst 
requirement is equivalent to having each cube Q linked to every of its 
children. However, for every child Q' of Q, we have E{Q), E{Q') C Q 
and 

|h;(Q)| + |h;(g')l^(i-A)(i + 2-")|Q| 

so this requirement is automatically satished as soon as 

(l-A)(l + 2-'^) > 1. 

It may not be clear which cubes are essential and which are unnec¬ 
essary for keeping ^ or its subsets linked if we look at each cube alone 
and ask about its role in the entire lattice. However, we can certainly 
locate some cubes Q' that seem like good candidates for removal if we 
start with some cube Q ^ & and try to “comb” the lattice ^ starting 
with Q and going down. Then, at each moment when we consider some 
particular cube C Q as a candidate for removal, we still have the 
entire family E^Q') untouched and linked by its parent-child bonds. 
Thus, at this moment, the maximal linked descending chains starting 
at Q and passing through Q' will then pass through one of the children 
of Q', so we can safely bypass Q' in all those chains if each child of Q' 
is linked directly to Q or to some intermediate cube between Q and Q' 
that survived the previous combing. 

Hence we just dehne the condition V{Q, Q') by saying that V{Q, Q') 
is satished if every child oi Q' <Z Q is linked to Q (see Figureld). 



Figure 14. V{Q,Q'i) holds because every child of Q'^ 
is linked to Q, i.e., the corresponding sets encircled in 
brown intersect E{Q) (red), but V{Q,Q 2 ) fails, so the 
removal of Q 2 may disconnect the chain linking Q to the 
bottom right child of Q' 2 . 
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Combing the lattice down from some fixed cube in the manner 
described above is exactly what our stopping time tool does, so it is 
natural now to take a look at the families stop(Q,P). We would like 
them to be well-dehned and Carleson, so we have to check 2 things: 

1) V{Q, Q) always holds. This was proved just a few lines ago when 
we talked about having the entire lattice linked. 

2) 'P{Q)Q') is a Carleson-type condition. Indeed, suppose that 
Q'j is a family of pairwise disjoint subcubes of Q such that V{Q,Q'j) 
fails for all j. Then for each Q'^ there exists its child Q” such that 
E{Q'') n E{Q) = 0. Since we still have E{Q'-) C Q'- C Q, we obtain 

on 

Ei«;i = 2”Ei«"i<rvAEw")i 

3 3 3 

on on\ 1 

provided that A ^ 

Now just take any regular 2-Carleson family (say, the family of all 
cubes in ^ containing the origin), and augment it by stop(Q,n). By 
Lemmas 6.7 and 6.8, the resulting family S is 6-Carleson. Also, it has 
the property that V{Qs,Q) holds for every Q E This can be used 
in two ways. 

Suppose that Q,Q' E S, Q 0 Q', and ds{Q', Q) = 1. Then Q is the 
iS-roof of the parent Q of Q', so V{Q, Q) holds, i.e., Q is linked to Q' 
(as well as to the dyadic brothers of Q', but that part is fairly useless). 
Thus S is linked. 

Another way to use this condition is to take any weak Lebesgue 
point X G M"" of / and, for hxed e > 0, consider a cube R E & 
containing x such that 

\{yER-. \f{y) - f{x)\ > e}| < A|i?|. 

Let P be the parent of R. Then Pg E S, so we have 

I/I ^ 5^wa(/;Q)xq 

on E{Ps). However, we also have Ps linked to all children of P and, 
in particular to R, so 

I/I ^ Q)xq + R) 

QgS 

on E{R). Next, observe that u\{f]R) ^ 2e and the sets E{R) C R 
and {y E R : \f{y) — f{x)\ ^ e} intersect, provided that A < |, so we 
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finally conclude that 

i/wi« E ^a(/;Q)xq + 36. 

Since £ > 0 was arbitrary, the statement of the theorem follows. □ 

11. From a sum over all cubes to a dyadic sum 

The result of Section 9 can be restated as 

oo m 

(11.1) uj,(Tlfu . Q) ^ C(A) E J] l/ilQ„.+., • 

k=0 i=l 

Since the weak type bound implies that the measure of the set {x G 
M" : |T[/i, ...,fm\\ > £} is hnite for every e > 0 and /i,..., e 
we can apply Theorem 10.2 to estimate T[/i,...,/m] pointwise. A 
slight nuisance is that while in the resulting double sum Q runs over 
a Carleson subset iS of a hxed dyadic lattice the averages |/i|Qpfe+i] 
are taken over non-dyadic cubes. However, the Three Lattice Theorem 
allows one to dominate the right hand side of (11.1) by 3” dyadic sums 
without any effort: note that for each /c, the cube Q[ 2 >=+i] can be covered 
by a cube in one of the lattices (see Remark 3.2) of comparable size 

and, since the sidelengths of the cubes Qpfc+i] increase as a geometric 
progression, each cube R G can be used as an extension of Q[ 2 fc+i] 
only hnitely many times. At last, the ratio i-q/in is comparable with 
^Q/^Qpfe+i] ~ 2“^“^, so, since the ratio p{t)/p{ct) is uniformly bounded 
in t > 0 for every hxed c G (0,1), we get 

3" m 

.... /J; Q) « C{n, A) E E pVq/^p) n 

j=l R^^U) i=l 

RDQ 

and, by Theorem 10.2, 

3" m 

|r|/i,...,/„]KC(n,A) EE E 

j=l QG(S i=l 

RDQ 

almost everywhere. 

Now recall that for a hxed j, every cube Q G ^ is contained in a 
unique cube Qj G with = 3 £q. Since every Qj can arise as 
an extension cube for at most 3"^ cubes Q and since the sets E{Q) in 
the dehnition of the sparse family S can perfectly well serve to show 
the sparseness of the family Sj = {Qj : Q E S} only with a 3"^ times 
smaller sparseness constant, we conclude that for each set of functions 
and each A G (0, 2“”'“^], there exist 3" dyadic lattices 
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and 3" families Sj C each of which is A-Carleson with A = 6 • 3” 
such that 
( 11 . 2 ) 

3" m 

\T[h,...,U]\^C{n,\) EE E 

j=l QGSj R^^U) i=l 

pointwise. 


12. A useful summation trick 

In this section we describe a convenient way to estimate a double 
sum of the kind 

EE. 

Qe5 

RDQ 

where ^ is a dyadic lattice and S is an arbitrary regular subset of 

The way the sum is written means that we take all cubes in S one 
by one and start looking up taking all dyadic cubes R on the way into 
account. We want to rewrite it so that the direction we look in changes 
from up to down but the outer sum is still over S, not over This 
seems impossible until we realize that every cube i? G ^ has a roof 
Rs G S, after which the task becomes trivial: 

EE = EE E =E E ■ 

Qe 5 fle® Qg 5 ues UeS Qes,ReHs(u) 

RDQ UDQ QcR 

We shall now use this summation rearrangement with summands 
of the kind a{d^{Q, R))'^{R)xq, where a : [0, oo) is some rea¬ 

sonably fast decreasing function and T is some non-negative quantity, 
which, after some change of notation, is exactly the kind of expression 
we have in (11.2). 

We observe, as before, that 

EE a{d^{Q,R))^iR)XQ = E E a(d^(g,i?))T(i?)xQ. 

QeS fle® UeS Qes,ReHg(u) 

RDQ QCR 

Since R is now restricted to Hs{U), we can try to carry supj:jgj|^^((y) •i{R) 
out of the inner sum and look at the resulting bound 

^ sup ^(R)^ a{d^{Q,R))xQ. 

UqS QCS,ReHg(U) 

QCR 

It is now tempting to try to sum over R in the inner sum taking into 
account that for a hxed Q, all distances d^{Q, R),R E ^,R D Q, are 
pairwise different, so we can get ^^e worst case scenario. 

This is not a bad idea and we will do exactly that with one small but 
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essential modification: we will notice that the condition R G Hs{U) is 
much stronger than just R E ^ and, since the cube Q is also in S and 
Q C R C U, the distance d^{R,Q) from R G HsiU) to Q can never 
be less than ds{U,Q). Thus, splitting all possible cubes Q G U m S 
according to their distance from t/ in T^, we get the bound 

E E a{d^{Q,R))XQ 

m^O QeS,R€Hg(u) 

QCR<ZU,dg(Q,U)=m 

for the inner sum. 

Now, for fixed m, the sum 

a{d^{Q,R)) 

ReHg(U) 

RDQ 


is still a sum of values of a at distinct integers, but at this point we 
also know that none of them is less than m. Thus, we get the bound 
this sum instead of the sum over all I ^ 0. This may 
look like a small improvement for each particular Q, but it becomes 
dramatic when we sum over Q and take into account that for any fixed 
m, 

xq ^xu 

QGS,Q<ZU 

(Q,U)=m 

(the cubes Q in the summation are pairwise disjoint and are all con¬ 
tained in U). We can now write 


Y a{d^{Q,R))XQ ^Y 

m^O QgS,Rgh^(U) m^O QGS,d^{Q,U)=Tn l^m 

Q(ZR(ZU,d^{Q,U)^m QCU 


m^O l^m 


1^ + l)a(m) 


Xu- 


m^O 


If /9 = + l)a(m) < -l-cxo, then we get the final estimate: 


o(<ia(Q, R))'i(R)xQ 

Qg 5 «e® U€S 

RDQ 


sup T(i?) Xt/. 

RGHsiU) '' 


13. Prom Calderon-Zygmund operators to sparse operators 


The summation trick allows us to estimate T[fi,, fm] by a sum 
of 3" expressions of the kind 


Qg5 


m 

IMr^xq 

i=l 
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(one for each lattice under the additional assumption that 

OO 

mp(2"™) < +CX), 

m=l 

or, equivalently, jY{t)\ogjY < +cxo. All classical operators have 
pit) = for some 5 > 0, so making this assumption hardly restricts 
the generality of the results we present. 

Now we want to go one step further and replace the supremum by 
a single average. Of course, this is impossible if we keep the original 
family S intact, but, since the only thing we know (or care) about is 
that S be Carleson, we can extend S in any way we want to a new 
family S before applying the summation trick as long as the extension 
preserves the Carleson property, and here is where the augmentation 
tool comes handy. 

No matter how we extend, the top cube Q of HgiQ) will always 
remain in the supremum. So, our task will be to make sure that for 
every R G H§iQ), the product \ fi\R is controlled by l/ib- 
This is exactly what the stopping time construction does if we choose 
the condition V in an appropriate way. The most natural condition 
ViQ, Q') to introduce is the one that 

m m 

i=l i=l 


with some (large) K ^ 1. The products may be a bit unpleasant to 
handle directly, so we will ask for a bit more. Namely we will say that 
ViQ,Q') holds if l/ilg' ^ for every i = 1,... ,m. Then, for 

the augmented system S, we will automatically have 


sup 


^=1 i=l 


for every Q G S and our sum will get reduced to 

m 

En IMqxq- 

QG<S*=1 


Since we want to keep the family S Carleson, we need to make sure 
that our condition V is of Carleson type. Suppose that Q' C Q are 
disjoint cubes for which V{Q,Q'j) fails. Then for each j, there exists 
i{j) e {!,...,m} with lo; > /C'''”|/.0 )|q. Let Ji be the set of j 
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for which i{j) = i. Then 

A-'''’"l/ilQ5^IO'l < = llil 

j€Ji j&Ji 

^ f\M = \MQ\Ql 

JQ 

whence \Q'j\ ^ K~^/'^\Q\ for each i and Yhj \Q'j\ ^ mK~^/'^\Q\. 

Thus, choosing K = (2m)"*, we get the desired Carleson type property 
for V. 

Now our strategy for the estimate is clear. First, take each double 
sum 

m 

X/ n l/iUXQ, 

Q^Sj R£^U) i=l 

R^Q 

and replace each Carleson family Sj by the augmented family Sj, which 
is 2(6 ■ 3" + l)-Carleson by Lemmas 6.7 and 6.8. We now have 


sup 

ReH^ (Q) 


ni/.i«« (2mrnii-ie 

2=1 2=1 


for all Q E Sj and since Sj C Sj, this replacement can only enlarge the 
double sum. After that, we apply the summation trick, and, assuming 
that p satisfies the condition p{t) log — < +cxo, get the sum 

m 

En l/ilgXQ 

Q&Sj 

dominating the original double sum up to a constant factor. 

This observation is worth singling out as a theorem. To state it 
clearer, let us make one definition first. 


Definition 13.1. Let ^ be any dyadic lattice and let 5 C ^ 
be a Carleson family of cubes. The m-linear sparse operator As,m 
corresponding to the family S is defined by 

m 

As ,771 [fl, ■ ■ ■ ,fm] — ]^(//)qXQ- 

Q€Si=l 


Theorem 13.2. LetT be any m-linear Calder on-Zygmund operator 
with the modulus of continuity p of the kernel satisfying 

/ M ^dt 

/ PWlog-y<+00. 
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Then, for every fi, ■ ■ ■, fm, there exist 3" dyadic lattices Q^d 14-3"- 
Carleson families Sj C such that 

gn 

im. ■ ■ ■./™ii < cm , i/„i] 

j=i 

almost everywhere. The constant C(T) depends on n,m,p, and the 
constant in the weak type hound for T. 


14. Elementary weighted theory of sparse multilinear forms 


The result of the previous section shows that, when building the 
theory of weighted norm inequalities for Calderon-Zygmund operators, 
we can (at least, as a first approximation) forget about the Calderon- 
Zygmund theory altogether and just consider sparse operators As,m 
instead. How much information will be lost on this way? This ques¬ 
tion is still awaiting its answer because the £t is pretty tight but not 
perfectly tight. At this point we prefer to abstain from discussing this 
issue in the hope that more light will be shed on it in the near future. 

Now fix S and consider the simplest linear one-weight problem of 
hnding a necessary and sufficient condition for .4,5^ to be bounded on 
L^(u), where 0 < is some weight and p G (1, -|-cxo). Recall that 


II^IIlpm = sup 


Fgv 


■.geL^ 


Lp'(v) 


^ 1 


where p' = is the conjugate exponent. Thus, we are interested in 
establishing an inequality of the kind 



for arbitrary non-negative functions f,g. 


14.1. The renormalization trick. Now, we see one interesting 
thing: the integrals in the -norm of g on the right and in the average 

{gv)Q are taken with the same weight, but it is not so for /. Can we 

restore the symmetry between / and g in this respect? The answer is 
“yes”. What we need to note is that if w is any positive function, then 
fw runs over all nonnegative functions as / does so, but the integrals on 
the left and on the right sides scale differently in w. So, our inequality 
can be rewritten as 

^U'w)Q{gv)Q\Q\ f riw^v)) ' ( [ gP'v) ' . 

^ ./R" ^ ^ JR" ^ 
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Now, to restore a perfect symmetry between / and g, we would like 
to find w so that w = w^v. Solving this equation gives 
or ta pv = 1. Thus, the one-weight linear inequality is equivalent 
to the 2-weight bilinear one 


^{fiWi)Q{f2W2)Q\Q\ ^ k( f ‘ ( [ /fwa) ^ 


with — = 1 and the extra relation = 1 (o,- = 1 — —) for the 

weights. 

The problem in this form, if we ignore for a moment all relations 
between Wj and pj, naturally generalizes to m weights and functions. 
To cover a few interesting cases that are not immediately transparent 
from the consideration of the one-weight linear problem alone, we intro¬ 
duce a few more parameters and state the general multilinear weighted 
norm inequality as 

( m \ m « , 

n(/.»><)s ioKA'n(/ 

i=\ ) i=i 


with Pi > 0,pi> 1. The reader who still wonders why it isn’t 

Pi! Pi 


Q^S V^—1 


101 < A'J] 


i=l 




should recall the renormalization trick above and try carrying out the 
reduction of this seemingly more general version to (14.1) by himself. 


15. A digression: the dyadic maximal function 


For a weight w and a measurable set E C M” denote w{E) = 

Let ^ be a dyadic lattice. Given a weight w, dehne the weighted 
dyadic Hardy-Littlewood maximal operator by 




1 


sup — 

Qe&:Q^x W 


iQ) 



\f\w- 


The following result is known as the Hardy-Littlewood dyadic maximal 
theorem. 


Theorem 15.1. The maximal operator satisfies the following 
properties: 

w{x e M'" : M^f{x) > a} ^ -||/||li(«,) (a > 0) 

a 


l|V*/|tL,(„) ^ (1 < P oo). 


(16.1) 

and 

(15.2) 
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Proof. Let C ^ be any finite family of cubes. Consider the 
restricted maximal function 

M^f = I Jglflw.QeT^QBx}, X e 


0 , 


otherwise. 


By the monotone convergence theorem, it suffices to prove (15.1) and 
(15.2) for M^. 

For a > 0, let 

flo = {x G M” : f{x) > a}. 


Then Hq is just the union of the maximal cubes Qj G T with the 
property that \f\w > aw{Qj). Since Qj are disjoint, we get 


(15.3) 


j 



\f\w- 


This, obviously, implies the weak type bound for . 

To get the LP(tc)-bound for 1 < p < oo (the remaining case p = oo 
is obvious), just integrate (15.3) with the weight pa'^~^: 


\\M^f\\lp(n^) = pI ^w{Vta)da^p j M / \f\w \ da 




,p-2 


= P 


a^ ^|/(x)|tc(x) dxdQ! 




{(q!,x):0<o<M^ (a;)} 

P - 1 Jk- 
P 

p — 1 




P-1 

p 


\ 1/p 


Assuming that / is bounded and compactly supported (so all inte¬ 
grals in the last inequality are finite), we conclude that 



P 

p — 1 



For an arbitrary function / G U’{w), just consider the truncated func¬ 
tions 

\^\ <tAfi^)\ <t 

[O otherwise 

and use the monotone convergence theorem with t ^ oo. □ 
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16. A bound for weighted sparse multilinear forms 

We now return to the questions of when the inequality 

( m \ m p , 

YlifiWiYg ) IQI ^ ^ ’ 

i=l J i=l 

holds for all nonnegative /* G with some A > 0 and of how 

to hnd a good estimate for K in terms of some reasonable quantities 
that can be computed directly in terms of the weights Wi. We will 
make an additional assumption that YYaLi ^ ~ which is a direct 
generalization of the relation ^ + ^ = 1 in our main motivating bilinear 
case discussed above. 

The obvious necessary condition can be obtained by taking a cube 
Q E S and putting fi = xq for all i. Then, ignoring all terms in the 
sum on the left except the one corresponding to Q, we get 


!!("’■)§ ^ K 

i=l 

with qi = rj(l — for all Q E S. We call this condition the joint 
, gm)-Muckenhoupt condition for the system of weights Wi and 

denote 


[Wl, ..., Wm]s-,qu-,qm = S^P ' 

Note that if we want our bound to hold for all sparse multilinear forms 
corresponding to all possible sparse families S with hxed sparseness 
constant ?7 > 0, we need our condition to hold for all cubes Q C M”, so 
it is natural to dehne the full joint {qi,..., gm)-Muckenhoupt condition 
as 


[Wl, . . . , Wm]qu...,qm = SUp ' 

is a cube 

The obvious way to use the sparseness of S is to switch to the 
disjoint sets E{Q). Note that we have no guarantee that fi are not 
small on E{Q). However, we can be sure that the dyadic maximal 
functions satisfy M^Ji ^ E{Q). Thus, using that 
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1 ^ = 1 again, we obtain the trivial Holder bound 


(16.2) n 

i=\ 




ri/pi 


>n(i: 

Q^S' 


2=1 


{MZfiY’mi 


E{Q) 


ri/pi 




n(5:( 


i=l Qes 


{fiWi)Q 

K)q 


Pi \ 

w^{E{Q))) 


ri/pi 


m 


Q&S i=l 


UiWiTq 


[Wi 


^w,{E{Q)r/^K 


'Q 


By the Hardy-Littlewood dyadic maximal theorem, we see that the 
left hand side of (16.2) is dominated by the right hand side of (16.1). 
A good chunk of the classical theory of weighted norm inequalities can 
be derived from comparing the right hand side of (16.2) with the left 
hand side of (16.1). 

The direct multilinear analogue of the linear one-weight case is the 
situation where the weights Wi are related by Hlii '^T = 1- Write 


'^iJQ 


y2=l 


2=1 


2=1 


X 


/ m 

TT ( 

t {wi)Q yPpA 

Ml I 

\i=i 

Kw,{E{Q))) J 


The hrst factor is just the quantity on the right hand side of (16.2); 
the second factor is bounded by [tci,..., Wm\s-,qi,...,qmy the whole game 
is in getting a decent bound for the last factor. Fortunately, in the case 
under consideration, this factor cannot be too large. Put q = ?*• 

Then, by Holder’s inequality, 

m p m 

wmQW'''^ / Tlwl'" = \E(Q)\^r,\Q\. 

Je{q) 



Thus, we always have 


(16.3) 






Denoting ai{Q) = and (3i{Q) = ^ ^ Oi(Q), we see that the 


IQI 


product we need to estimate can be written as Y\T=i (o^(§) 


ri/pi 


and 
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the Muckenhoupt condition along with (16.3) implies that 



< 


n( 


IQI 

Wi{E{Q)) 


m 

nK)Q 




Since each ratio 


ft(Q) 

«i(Q) 


is at least 1, we conclude that 


TT C f^ijQ) 

Wq) 


i/pi 




max —— 
l^i^m Pi^i 


and, thereby, 


(16.4) 




7 

S-,qi,...,qm 


where 

7 = max (l H- —) = max (l H-—^) = max p'- 

' PiQi ^ pjrj(l — 

and 

f'i f'i 1 

T = q max -= q max -^^ = q max - . 

IsgisSm p^q^ pjrj(l - — ) Kisgrri Pj - 1 

Observe that the Lp{w) operator norm of blows up as p —)■ 1, 
but approaches 1 as p —)■ +cx3. Thus, despite the argument above has 
been carried out for finite p, the same inequality 


QeS i=l 


< 


Pi 


Ifvp.! 

i=l 


Ti 


[rCi, . . . , Wm\'s-qi,...,q^ TT II 


2=1 


can be obtained for the case when some of p* are equal to +oo either 
by repeating the proof, or just by passing to the limit. 

Note that the passage to the limit is not completely trivial because 
one has to be careful to make sure that all the related quantities, indeed, 
do tend to what one wants them to tend to and that the conditions 
Er=i = 1 ^nd nr= ^ wf = 1 hold all the way through. The easiest 
way to ensure that all the limits exist and are correct is to assume that 
S is hnite and all test functions are compactly supported and bounded, 
and to let the monotone convergence theorem take care of the general 
case. To preserve the conditions, one can put, for example, = 
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^ 1 (^1 - ^ j and ri{t) = with '0(t) = {1 - t + t X]™ i r*) ^ and 

t tending to 0+. 


17. The weighted L^-estimates 


We are now ready to consider the action of As,m from x 

• • • X LP’^{vm) to LP{v). Since we are ultimately aiming at transferring 
the result we will obtain to the case of multilinear Calderon-Zygmund 
operators, we will be interested in the uniform bounds for the whole 
family of sparse operators with hxed sparseness constant rj > 0 here, 
rather than in the action of each individual one like it was in the pre¬ 
vious section. 

Note that iS = is a 1-Carleson family, and that the corresponding 
sparse operators converge to the “trivial” multilinear operator 


as A; —)■ oo. Thus, to get a natural setup, we want to be sure that, at 
least. To acts as a bounded multilinear operator from L^^{vi) x • • • x 
LP^{vm) to LP{v). 

Observe that the product of m functions fi G LP^{vi) of norm 1 is 
an arbitrary function from the unit ball of L^{v) with p given by ^ = 

YlT=i ^ = YYiLi Thus, the best we can hope for is that the 

entire family of sparse operators As^m acts from (ui) x • • • x 
to U‘{v) with these particular p and v. If the families of weights Vi and 
powers Pi satisfy this property, then any other statement about the 
operator boundedness of As^m from Lp^{vi) x • • • x L'P'^iym) to some 
weighted T^-space either follows from it, or is false. 

We will now try to reduce this case to the estimates for weighted 
multilinear forms obtained in the previous section using duality and 
renormalization. Since the duality trick increases the number of func¬ 
tions and integrations by 1, the action of As,m in weighted spaces is 
equivalent to the boundedness of some weighted m + 1-form. Note also 
that for m > 1, we can easily have p < 1 even under our assumption 
that all Pi > 1, so the duality argument has to be applied with some 
caution. 

Assume hrst that p > 1. Then we can take Pm+i = = 1 

(so Qi = 1 — A for i ^ m and qm+i = “)• Note that with this 

choice, ^ ~ p A ^ The renormalization trick implies that 

\\As,m\\LPiivi)x-xLPm(vm)^Lp(v) < K H and only if the estimate 


m+1 


• • • : f m\ f m+1 ^ ^ n ii/<ii LPi {Wi)^ 


2=1 
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holds for all non-negative functions fi G L^^{wi) with wj = Vi[t = 
1 ,... ,m) and Wm+i = v. Observe now that in this case, the relation 


V = 


n 


2=1 




p/pi 


becomes Wm+i = i 


w, 


or w, 


i/p 

m+1 


n: 


m 
2 = 1 


Wi = 1 , 


which is exactly the additional relation we imposed on the weights 
when proving (16.4). Thus, the results of the previous section yield 


\\-^S,m\\LPl{vi)x---xLP^{vm)^LP(v) ^ Pi) ['^1; • • • ; 

with 

7 = , max p' = max(p;,... ,p(^,p). 

1^2^m+l 

In the case p ^ 1 , we put Pm+i = cxo, r* = p for i = 1 ,..., m, and 
Tm+i = 1- We still have — = y]T-^ — + — = 1 + 0 = 1. Now 

gj = p(l — +) for i ^ m, qm+i = 1 , and we see that to apply the results 

of the previous section, we need the relation Wm+i Y\T=i'^\ 
which is the same as before. Our inequality then becomes 


( m 

n(/i ) Q (/m+11) Q 

2 = 1 


IQI 


m 

^ 0(p,Pj)[u'i, . . . , ^ ll/*lll,Pi(iOi)) ll/m+l||L°°(«),„+l)- 

2=1 


or, equivalently, 


• • • 5 frn'^rn]'^7n-\-l 

m 

^ C'(p,p,)hi,.. n 

2 = 1 


where 

m 

7lp,*S,m[/l, . . . , /m] = ^ YIU^qXQ ■ 

QeS i=l 

Since p ^ 1 , we have As,m[fi, • • •, fm]^ < Ap,s,m[fi, ■ ■ ■, fm] pointwise 
for any non-negative functions fi. Hence, in this case, we get 


42I7TI + I ) 

m 

^ 0(p,Pi)[M;i, . . .,Wra+l] 

i=l 


with 

7 = max(p;,...,p(„,l). 

Note that the algebraic expressions for the numbers g* in the case 
p ^ 1 are exactly p times the corresponding expressions in the case 


50 


p > 1, SO writing everything directly in terms of pi, we see that the 
Muckenhoupt factor is 

]max(pp...,p;„,p) 

in both cases. 

By Theorem 13.2, these estimates can be immediately extended to 
multilinear Calderon-Zygmund operators and we obtain the following 



Theorem 17.1. LetT be a multilinear Calderon-Zygmund operator 
with the modulus p of continuity of the kernel satisfying 


< +00- 


Let Vi be any weights and Pi > 1 be any numbers (i = 1,... ,m). Define 
P > 0 by i = and put v = > '^rn+i = v, 


Pm 


Wi = V, 


• (i = 1,... ,m). Then 

\\T\\LPi{vi)x-xLP^{vm)^LP{v) ^ C{T,Pi)[wi, . . . 


= C{T,pi) 


sup 

is a cube 


1/p 

’<3 


ni 

2=1 


V, 


-v'ilviDiv'i 
)Q 


max(p',...,p;„,p) 


The only remark that remains to make to juxtapose our statement 
of this theorem with the way it is usually written in the literature is 

that supQc;Kn,Q is a cube 11™ 1 IS usually denoted by [v\//,. 

In particular, in the linear case, when v = v, P = p, we obtain 




max(^,l) 


Historical notes 

Multilinear Calderon-Zygmund operators in the form considered in 
this book probably first appeared in Coifman and Meyer [1], but their 
systematic study was started by Grafakos and Torres [5]. 

For a classical (linear) theory of Ap weights we refer the reader to 
Garcia-Cuerva and Rubio de Francia [3] and to Grafakos [4]. Mucken¬ 
houpt families of weights for multilinear operators were introduced in 
Lerner et ah 110], 

The main result of Section 10, Theorem 10.2, has been well known 
in the local form, i.e., for functions dehned on some cube Q C M” (see, 
for instance, Hytonen [7]) . 

The linear case of Theorem 17.1 with p = 2 has become known 
as the “A 2 conjecture”. It was hrst proved for the Hilbert transform 
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by Petermichl [ 12 ] and for general Calderon-Zygmund operators by 
Hytonen [6]. The approach based on dyadic sparse operators is due 
to Lerner [9], where a weaker form of Theorem 13.2 with the Banach 
function space norm estimate instead of the pointwise estimate was 
obtained. 

An alternative proof of Theorem 13.2 was given by Conde-Alonso 
and Rey in [ 2 ]. Later, Lacey [ 8 ] relaxed the condition on the modulus 
of continuity p to Jq < oo. 

The bounds for weighted sparse multilinear forms in Section 16 
were obtained by Li, Moen, and Sun [ 11 ]. Their ultimate goal in 
that paper was also to derive Theorem 17.1, but, lacking the pointwise 
bound given by Theorem 13.2, they were forced to stay within the cat¬ 
egory of Banach function spaces and to introduce the extra assumption 

p ^ 1. 
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